CURLER: Finding and Visualizing Nonlinear Correlation Clusters

نویسندگان

  • Anthony K. H. Tung
  • Xin Xu
  • Beng Chin Ooi
چکیده

While much work has been done in finding linear correlation among subsets of features in high-dimensional data, work on detecting nonlinear correlation has been left largely untouched. In this paper, we present an algorithm for finding and visualizing nonlinear correlation clusters in the subspace of high-dimensional databases. Unlike the detection of linear correlation in which clusters are of unique orientations, finding nonlinear correlation clusters of varying orientations requires merging clusters of possibly very different orientations. Combined with the fact that spatial proximity must be judged based on a subset of features that are not originally known, deciding which clusters to be merged during the clustering process becomes a challenge. To avoid this problem, we propose a novel concept called co-sharing level which captures both spatial proximity and cluster orientation when judging similarity between clusters. Based on this concept, we develop an algorithm which not only detects nonlinear correlation clusters but also provides a way to visualize them. Experiments on both synthetic and real-life datasets are done to show the effectiveness of our method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visualizing the Clusters and Dynamics of HPV Research Area

Purpose: The purpose of the present study is to visualize HPV clusters’ relationships and thematic trends in the world. Methodology: The research type is an applied one with analytical approach and it has been done using co-word analysis. The population of this study consists of articles’ keywords indexed during 2014-2018 in the Web of Science (WoS) in HPV subject area. The total numbers of th...

متن کامل

Why do Bimetallic Clusters have more Chemical Reactivity? Study the VnNim (2 ≤ n + m ≤ 6) Clusters as the Nano Species

This article gives you proof that bimetallic transition metal clusters with the difference in electronegativity are better catalysts than monoatomic one. To prove this fact, a study of ethylene adsorption on bimetallic clusters vanadium-nickel VnNim (2≤n+m≤6) has been demonstrated. Our result shows that hardness has a quite good linear correlation with the non-Lewis of VnNi (n=1-5) cluster (R2=...

متن کامل

Finding and Visualizing Subspace Clusters of High Dimensional Dataset Using Advanced Star Coordinates

Analysis of high dimensional data is a research area since many years. Analysts can detect similarity of data points within a cluster. Subspace clustering detects useful dimensions in clustering high dimensional dataset. Visualization allows a better insight of subspace clusters. However, displaying such high dimensional database clusters on the 2-dimensional display is a challenging task. We p...

متن کامل

Finding and Visualizing Graph Clusters Using PageRank Optimization

We give algorithms for finding graph clusters and drawing graphs, highlighting local community structure within the context of a larger network. For a given graph G, we use the personalized PageRank vectors to determine a set of clusters, by optimizing the jumping parameter α subject to several cluster variance measures in order to capture the graph structure according to PageRank. We then give...

متن کامل

Visualizing Multiple System Atrophy Studies Based on Collaboration Network and Centrality Indices in Web of Science Database

Introduction: Social network analysis is an analytical method based on graph theories that identifies relationships between individuals or factors to analyze the social structures resulted from those relationships. The objective of this study was to analyze co-authorship and co-word networks based on scientometric indicators and centrality measures in the studies on multiple atrophy system dise...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005